Goto

Collaborating Authors

 floorplan reconstruction


SLIBO-Net: Floorplan Reconstruction via Slicing Box Representation with Local Geometry Regularization

Neural Information Processing Systems

This paper focuses on improving the reconstruction of 2D floorplans from unstructured 3D point clouds. We identify opportunities for enhancement over the existing methods in three main areas: semantic quality, efficient representation, and local geometric details. To address these, we presents SLIBO-Net, an innovative approach to reconstructing 2D floorplans from unstructured 3D point clouds. We propose a novel transformer-based architecture that employs an efficient floorplan representation, providing improved room shape supervision and allowing for manageable token numbers. By incorporating geometric priors as a regularization mechanism and post-processing step, we enhance the capture of local geometric details. We also propose a scale-independent evaluation metric, correcting the discrepancy in error treatment between varying floorplan sizes. Our approach notably achieves a new state-of-the-art on the Structured3D dataset. The resultant floorplans exhibit enhanced semantic plausibility, substantially improving the overall quality and realism of the reconstructions. Our code and dataset are available online1.


Representation with Local Geometry Regularization Supplemental Material

Neural Information Processing Systems

We compare our method with four competing methods in Table 1 of the main paper. We also use the score reported by [5]. We found that the corner-based methods, e.g., HEA T and RoomFormer, fail to reconstruct the correct floorplans and are easily affected by the irregular Heat: Holistic edge attention transformer for structured reconstruction. Real-world perception for embodied agents.



SLIBO-Net: Floorplan Reconstruction via Slicing Box Representation with Local Geometry Regularization

Neural Information Processing Systems

This paper focuses on improving the reconstruction of 2D floorplans from unstructured 3D point clouds. We identify opportunities for enhancement over the existing methods in three main areas: semantic quality, efficient representation, and local geometric details. To address these, we presents SLIBO-Net, an innovative approach to reconstructing 2D floorplans from unstructured 3D point clouds. We propose a novel transformer-based architecture that employs an efficient floorplan representation, providing improved room shape supervision and allowing for manageable token numbers. By incorporating geometric priors as a regularization mechanism and post-processing step, we enhance the capture of local geometric details. We also propose a scale-independent evaluation metric, correcting the discrepancy in error treatment between varying floorplan sizes. Our approach notably achieves a new state-of-the-art on the Structure3D dataset. The resultant floorplans exhibit enhanced semantic plausibility, substantially improving the overall quality and realism of the reconstructions. Our code and dataset are available online.


CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction

arXiv.org Artificial Intelligence

We present CAGE (Continuity-Aware edGE) network, a robust framework for reconstructing vector floorplans directly from point-cloud density maps. Traditional corner-based polygon representations are highly sensitive to noise and incomplete observations, often resulting in fragmented or implausible layouts.Recent line grouping methods leverage structural cues to improve robustness but still struggle to recover fine geometric details. To address these limitations,we propose a native edge-centric formulation, modeling each wall segment as a directed, geometrically continuous edge. This representation enables inference of coherent floorplan structures, ensuring watertight, topologically valid room boundaries while improving robustness and reducing artifacts. Towards this design, we develop a dual-query transformer decoder that integrates perturbed and latent queries within a denoising framework, which not only stabilizes optimization but also accelerates convergence. Extensive experiments on Structured3D and SceneCAD show that CAGE achieves state-of-the-art performance, with F1 scores of 99.1% (rooms), 91.7% (corners), and 89.3% (angles). The method also demonstrates strong cross-dataset generalization, underscoring the efficacy of our architectural innovations. Code and pretrained models are available on our project page: https://github.com/ee-Liu/CAGE.git.



FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

arXiv.org Artificial Intelligence

Abstract--Reconstructing building floor plans from point cloud data is a critical technology for indoor navigation, building information modeling (BIM), and highly accurate precise indoor measurement applications. Traditional methods, such as geometric algorithms and Mask R-CNN-based deep learning for mask segmentation, often suffer from sensitivity to noise, limited generalization, and loss of geometric details, severely impacting measurement accuracy. This study proposes an innovative framework, FloorSAM, that integrates room-height point cloud density maps with the guided segmentation capabilities of the Segment Anything Model (SAM) to enhance the precision of floor plan reconstruction from LiDAR point cloud data. By applying grid-based filtering to retain elevation point clouds near the ceiling of each region, combined with adaptive resolution projection and image enhancement techniques, a top-down density map is generated, improving the robustness and accuracy of spatial feature measurement. This framework leverages SAM's zero-shot learning to achieve high-fidelity room segmentation, remarkably enhancing reconstruction and measurement accuracy across diverse building layouts. Subsequently, leveraging SAM's zero-shot guided segmentation capabilities, high-quality room masks are generated based on adaptive prompt points, followed by a multistage filtering process to extract precise semantic masks for individual rooms. Through joint analysis of mask and point cloud modalities, contour extraction and regularization are performed, integrating semantic segmentation with geometric information to produce accurate room floor plans and recover topological relationships between rooms.


Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction

arXiv.org Artificial Intelligence

Floorplan reconstruction provides structural priors essential for reliable indoor robot navigation and high-level scene understanding. However, existing approaches either require time-consuming offline processing with a complete map, or rely on expensive sensors and substantial computational resources. To address the problems, we propose Floorplan-SLAM, which incorporates floorplan reconstruction tightly into a multi-session SLAM system by seamlessly interacting with plane extraction, pose estimation, and back-end optimization, achieving real-time, high-accuracy, and long-term floorplan reconstruction using only a stereo camera. Specifically, we present a robust plane extraction algorithm that operates in a compact plane parameter space and leverages spatially complementary features to accurately detect planar structures, even in weakly textured scenes. Furthermore, we propose a floorplan reconstruction module tightly coupled with the SLAM system, which uses continuously optimized plane landmarks and poses to formulate and solve a novel optimization problem, thereby enabling real-time incremental floorplan reconstruction. Note that by leveraging the map merging capability of multi-session SLAM, our method supports long-term floorplan reconstruction across multiple sessions without redundant data collection. Experiments on the VECtor and the self-collected datasets indicate that Floorplan-SLAM significantly outperforms state-of-the-art methods in terms of plane extraction robustness, pose estimation accuracy, and floorplan reconstruction fidelity and speed, achieving real-time performance at 25-45 FPS without GPU acceleration, which reduces the floorplan reconstruction time for a 1000 square meters scene from over 10 hours to just 9.44 minutes.


SLIBO-Net: Floorplan Reconstruction via Slicing Box Representation with Local Geometry Regularization

Neural Information Processing Systems

This paper focuses on improving the reconstruction of 2D floorplans from unstructured 3D point clouds. We identify opportunities for enhancement over the existing methods in three main areas: semantic quality, efficient representation, and local geometric details. To address these, we presents SLIBO-Net, an innovative approach to reconstructing 2D floorplans from unstructured 3D point clouds. We propose a novel transformer-based architecture that employs an efficient floorplan representation, providing improved room shape supervision and allowing for manageable token numbers. By incorporating geometric priors as a regularization mechanism and post-processing step, we enhance the capture of local geometric details.